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ARITHMETIC PROGRESSIONS IN SETS WITH SMALL 

SUMSETS 

JOZSEF SOLYMOSI 


Abstract. We present an elementary proof that if A is a finite set of numbers, 
and the sumset A-\-q A is small, \A-\-q A\ < c|A|, along a dense graph G, then 
A contains fc-term arithmetic progressions. 


1. Introduction 

A well known theorem of Szemeredi m states that every dense subset of integers 
contains long arithmetic progressions. A different, but somehow related result 
of Freiman |7] says that if the sumset of a finite set of numbers A is small, i.e. 
A + A| < C\A\^ then A is the subset of a (not very large) generalized arithmetic 
progression. Balog and Szemeredi proved in ^ that a similar structural statement 
holds under weaker assumptions. (For correct statements and details, see [0]). As a 
corollary of their result, Freiman’s theorem, and Szemeredi’s theorem about fc-term 
arithmetic progressions, Balog and Szemeredi proved Theorem 1 below. The goal 
of this paper is to present a simple, purely combinatorial proof of this assertion. 

Let A be a set of numbers and G be a graph such that the vertex set of G is A. 
The sumset of A along G is 

A +G A = {a + b : a,b € A and (a, b) G E(G)}. 

Theorem 1. For every c, K, k > 0 there is a threshold uq = nQ(c,K,k) such that 
if |A| = n > no, \A +g A| < ATlAj, and |F(G)| > cn^, then A contains a k-term 
arithmetic progression. 


2. Lines and hyperplanes 

There are arrangements of n lines on the Euclidean plane such that the maximum 

2 

number of points incident with at least three lines is Not much is known about 
the structure of arrangements where the number of such points is close to the 
maximum, say cn^, where c is a positive constant. Nevertheless, the following is 
true. 

Lemma 1. For every c > 0 there is a threshold uq = ii-o(c) o.iT-d a positive 6 = 5{c) 
such that, for any set of n > no lines L and any set of m > cn^ points P, if 
every point is incident to three lines, then there are at least 6n^ triangles in the 
arrangement. (A triangle is a set of three distinct points from P such that any two 
are incident to a line from L.) 

Proof. This lemma follows from the following theorem of Ruzsa and Szemeredi 
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Theorem 2. m Let G be a graph on n vertices. If G is the union of an? edge- 
disjoint triangles, then G contains at least Sn^ triangles, where 5 depends on c 
only. 

To prove Lenima 1, let us construct a graph where L is the vertex set, and two 
vertices are adjacent if and only if the corresponding lines cross at a point of P. 
This graph is the union of cn^ disjoint triangles, every point of P defines a unique 
triangle, so we can apply Theorem 2.0 


The result above suffices to prove Theorem 1 for 3-term arithmetic progressions. 
But for larger values of k, we need a generalization of Lemma 1. 

Lemma 2. For every c > 0 and d > 2 , there is a threshold uq = n(j{c,d) and a 
positive S = 5{c,d) such that, for any set of n > uq hyperplanes L and any set of 
m > cn‘^ points P, if every point is incident to d 1 hyperplanes, then there are 
at least simplices in the arrangement. (A simplex is a set of d-\-l distinct 

points from P such that any d are incident to a hyperplane from L.) 

Lemma 2 follows from the Frankl-Rodl conjecture [HI, the generalization of The¬ 
orem 2. The d = 3 case was proved in [^1 and the conjecture has been proved 
recently by Gowers |H] and independently by Nagle, Rodl, Schacht, and Skokan 
[2],|S]. For details, how Lemma 2 follows from the Frankl-Rddl conjecture, see m 

3. The fc = 3 case 

Let ^ be a set of numbers and G be a graph such that the vertex set of G is A. 
We define the difference-set of A along G as 

A —Q A = {a — b : a,b ^ A and (a, b) G F(G)}. 

Lemma 3. For every e,c,K > 0 there is a number D = D{e,c,K) such that if 
\A -\-G A\ < K\A\ and \E{G)\ > cjAp, then there is a graph G' G G such that 
\EiG')\ > (1 - e)|F;(G)| and |A -c A| < £>1^1. 

Proof. Let us consider the arrangement of points given by a subset of the Carte¬ 
sian product A X A and the lines y = a, x = a for every a G A, and x -\- y = t for 
every t G A +g A. The pointset P is defined by (a,b) G P iff (a,b) G E{G). By 
Lemma 1, the number of triangles in this arrangement is 5n^. The triangles here 
are right isosceles triangles. We say that a point in P is popular if the point is the 
right-angle vertex of at least an triangles. Selecting a = where i5(-) is the 

function from Lemma 1, all but at most ecn^ points of P are popular. 

A. t G A — A is popular if |{(a, 5) : a — b = t]a,b G A}| > an. The number of 
popular ts is at most Dn, where D depends on a only. A x A is a Cartesian product, 
therefore every triangle can be extended to a square adding one extra point from 
A X A. Every popular point p is the right-angle vertex of at least an triangles. 
Therefore p is incident to a line x — y = t, where t is popular, because this line 
contains at least an “fourth” vertices of squares with p. □ 


Proof of Theorem 1, case fc = 3. Let us apply Lemma 1 to the pointset P' 
defined by (a, b) G P' iff (a, b) G E{G') and the lines are y = a for every a G A, 
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X — y = t loT every t € A —c> A, and x + y = s ioi every s G A +g A. By Theorem 
2, if |A| is large enough, then there are triangles in the arrangement. The vertices 
of such triangles are vertices from P' C A x A. The vertical lines through the 
vertices form a 3-term arithmetic progression and therefore A contains S'n? 3-term 
arithmetic progressions, where (5 > 0 depends on c only. □ 


4. The general, fc > 3, case 

Following the steps of the proof for fc = 3, we prove the general case by induction 
on fc. We prove the following theorem, which was conjectured by Erdos and proved 
by Balog and Szemeredi in Theorem 3, together with the fc = 3 case, gives a 
proof of Theorem 1. 

Theorem 3. For every c > 0 and fc > 3 there is an no such that, if A contains at 
least c|Ap 3-term arithmetic progressions and |A| > no, then A contains a k-term 
arithmetic progression. 

Instead of triangles, we must consider simplices. Set k = d. In the d-dimensional 
space we show that Ax - ■ - xA, the d-fold Cartesian product of A, contains a simplex 
in which the vertices’ hrst coordinates form a (d -I- l)-term arithmetic progression. 

The simplices we are looking for are homothetic^ images of the simplex Sd whose 
vertices are listed below: 

( 0 , 0 , 0 , 0 ,..., 0 , 0 ) 

( 1 , 1 , 0 , 0 ,..., 0 , 0 ) 

( 2 , 0 , 1 , 0 ,..., 0 , 0 ) 

(3,0,0,1,...,0,0) 


(d-l,0,...,l,0) 

(d, 0,0,0,...,0,0). 

An important property of Sd is that its facets can be pushed into a “shorter” grid. 
The facets of Sd are parallel to hyperplanes, defined by the origin (0,0, 0,0,..., 0, 0), 
and some (d — l)-tuples of the grid 

{0,l,2,...,d-l}x{-l,0,l}x{0,l}'^-2. 

For example, if d = 3, then the facets are 

{( 0 , 0 , 0 ),( 1 , 1 , 0 ),( 2 , 0 , 1 )} 

{(0,0,0),(1,1,0),(3,0,0)1 
{(0,0,0),(2,0,1),(3,0,0)1 
{(1,1,0),(2,0,1),(3,0,0)}, 

and the corresponding parallel planes in 

{0,1,2} X {-1,0,1} X {0,1} 
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Here we say that two simplices are homothetic if the corresponding facets are parallel. 



4 


JOZSEF SOLYMOSI 


are the planes incident to the triples 

{( 0 , 0 , 0 ), ( 1 , 1 , 0 ), ( 2 , 0 , 1 )} 

{( 0 , 0 , 0 ),( 1 , 1 , 0 ),( 2 , 0 , 0 )} 

{( 0 , 0 , 0 ),( 2 , 0 , 1 ),( 2 , 0 , 0 )} 

{( 0 , 0 , 0 ),( 1 ,- 1 , 1 ),( 2 ,- 1 , 0 )}. 

In general, if a facet of Sd contains the origin and the “last point” {d, 0, 0,0,..., 0, 0), 
then if we replace the later one by (d — 1, 0,0, 0 ,..., 0, 0 ), the new d-tuples define 
the same hyperplane. The remaining facet /, given by 

( 1 , 1 , 0 , 0 ,..., 0 , 0 ) 

( 2 , 0 , 1 , 0 ,..., 0 , 0 ) 

(3,0,0,1,...,0,0) 


(d-l,0,...,l,0) 

(d, 0,0,0,...,0,0), 

is parallel to the hyperplane through the vertices of / — (1,1, 0,0 ,..., 0, 0), 

( 0 , 0 , 0 , 0 ,..., 0 , 0 ) 

( 1 ,- 1 , 1 , 0 ,..., 0 , 0 ) 

(2,-l,0,l,...,0,0) 


(d- 2 ,-l,...,l, 0 ) 

(d- 1 ,- 1 , 0 , 0 ,..., 0 , 0 ). 

In a homothetic copy of the grid 

Td = {0,1,2,..., d - 1} X {-1,0,1} X {0, 

the image of the origin is called the holder of the grid. 

As the induction hypothesis, let us suppose that Theorem 3 is true for a /c > 3 
in a stronger form, providing that the number of fc-term arithmetic progressions in 
A is at least c\A\^. 

Then the number of distinct homothetic copies of Td in Ad = A x . . . x A is at 

d 

least (c' depends on c only). Let us say that a point p G Ad is popular if 

p is the holder of at least q;|A| grids. If p is popular, then for any facet of Sd, f, 
the point p is the element of at least a\A\ d-tuples, similar and parallel to /. If a is 
small enough, then at least ^\A\’^ points of Ad are popular, where 7 depends on c 
and a only. 

A hyperplane H is fd-rich if it is incident to many points, lidnA^I > (3\A\'^~^. For 
every facet of Sd, f, let us denote the set of /3-rich hyperplanes which are parallel 
to / by Hf. 

Lemma 4. For some choice of fd, at least half of the popular points are incident 
to d + 1 fd-rich hyperplanes, parallel to the facets of Sd- 

Suppose to the contrary that for a facet /, more than popular points are 

not incident to hyperplanes of 7d/. Then more than 





d+1 
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d-tuples, similar and parallel to /, are not covered by 7i/. Let us denote the hyper¬ 
planes incident to the “uncovered” d-tuples by Li, L 2 ,..., Lm, and the number of 
points on the hyperplanes by £ 1 , £ 2 , ■ ■ •, £m- A simple result of Elekes and Erdos 
implies that hyperplanes with few points cannot cover many d-tuples. 

Theorem 4. The number of homothetic eopies of f in Li is at most 
where Cd depends on d only. 

The inequalities 

m 

and Q < (3\A\'^-\ 

i=l 

lead us to the proof of Lemma 4. 

The number of d-tuples covered by L^s is at most 

If we compare this bound to (1), and choose (3 such that ^ then 

at least half of the popular points are covered by d -I- 1 /3-rich hyperplanes parallel 
to the facets of S'd- □ 


Finally we can apply Lemma 2 with the pointset P of “well-covered” popular 
points of Ad and with the sets of hyperplanes L = U/cSd^/’ number of 

points is at least For a given /, {Hfl < = |A|//3. The number of 

hyperplanes in L is at most (d -I- l)|A|//3. By Lemma 2, we have at least 
homothetic copies of Sd in Ad- Let us project them onto Xi, the first coordinate 
axis. Every image is a (fc -I- l)-term arithmetic progression, and the multiplicity 
of one image is at most Therefore there are at least d'|Ap (fc -|- l)-term 

arithmetic progressions in A. □ 


5. Gn = Kn 

When the full sumset -|- A is small then it is easier to prove that A contains 
long arithmetic progressions. We can use the following Pliinecke type inequality 

[iniiniini. 

Theorem 5. Let A and B be finite subsets of an abelian group such that \A\ = n 
and |A -|- i3| < 5n. Let k> 1 and I > 1. Then 

\kB - IB\ < 6'^+^n. 

It follows from the inequality, that for any dimension d and d-dimensional integer 
vector V = {xi, ..., Xd), Xi G Z, there is a c > 0 depending on d, S and v such that 
the following holds: If |A -|- < <5|A|, then Ad can be covered by c\A\ hyperplanes 

with the same normal vector v. Using this, we can define our hyperplane-point 
arrangement, with the hyperplanes parallel to the facets of Sd containing at least 
one point of Ad, and the pointset of the arrangement is Ad- Then we do not have 
to deal with rich planes and popular points, and we can apply Lemma 2 directly. 
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